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(54) Image processing apparatus 

(57) In the image processing apparatus capable of 
performing flexible, high speed processing, a memory 
region emptied due to a change of display resolution can 
be used as a texture memory, increase ot an overhead 
such as switching pages can be prevented, and a de- 
cline in the performance is not caused. A built-in DRAM 
(147) inside a semiconductor chip (14) has a configura- 



tion for storing display data and the texture data required 
by at least one graphic element. The texture data can 
be stored in portions (147a) other than the display re- 
gions (147b), so the built-in DRAM (147) can be used 
efficiently and an image processing apparatus achiev- 
ing both high speed operation and a reduction of power 
consumption can be realized. 
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Description 

BACKGROUND OF THE INVENTION 
Field of the Invention 

[0001] The present invention relates to a graphic 
drawing image processing apparatus, more particularly 
relates to the technical field of storing texture data to be 
applied to a graphic element to be drawn in a built-in 
memory when a DRAM or other memory and a logic cir- 
cuit are provided together. 

Description of the Related Art 

[0002] Computer graphics are often used in a variety 
of computer aided design (CAD) systems and amuse- 
ment machines. Especially, along with the recent ad- 
vances in image processing techniques, systems using 
three-dimensional computer graphics are becoming 
rapidly widespread. 

[0003] In three-dimensional computer graphics, the 
color value of each pixel is calculated at the time of de- 
ciding the color of each corresponding pixel. Then, ren- 
dering is performed for writing the calculated value of 
the pixel to an address of a display buffer (frame buffer) 
corresponding to the pixel. 

[0004] One of the rendering methods is polygon ren- 
dering. In this method, a three-dimensional model is ex- 
pressed as a composite of triangular unit graphics (pol- 
ygons). By drawing the polygons as units, the colors of 
the display screen are decided. 

[0005] In polygon rendering, coordinates (x, y, z), 
color data (R, G. B ; ). homogeneous coordinates (s, t) of 
texture data indicating a composite image pattern, and 
a value of the homogeneous term q for the respective 
vertexes of the triangle in a physical coordinate system 
are input and processing is performed for interpolation 
of these values inside the triangle. 
[0006] Here, the homogeneous term q is, simply stat- 
ed, like an expansion or reduction rate. Coordinates in 
a UV coordinate system of an actual texture buffer, 
namely, texture coordinate data (u, v) are comprised of 
the homogeneous coordinates (s, t) divided by the ho- 
mogeneous term q to give "s/q" and "t/q" which in turn 
are multiplied by texture sizes USIZE and VSIZE, re- 
spectively. 

[0007] Figure 1 5 is a view of the system configuration 
showing a basic concept of a three-dimensional compu- 
ter graphic system. 

[0008] In this three-dimensional computer graphic 
system, data of drawing graphics is supplied from a 
main memory 2 in a main processor 1 or from an input/ 
output (I/O) interface circuit 3 for receiving graphic data 
from outside via a main bus 4 to a rendering circuit 5 
having a rendering processor 5a and a frame buffer 
memory 5b. 

[0009] In the rendering processor 5a, a frame buffer 



memory 5b for maintaining data for display and a texture 
memory 6 for maintaining texture data to be applied on 
the surface of a graphic element (for example, a triangle) 
to be drawn are connected. 
s [0010] Then, the rendering processor 5a performs 
processing for drawing the graphic element applied with 
the texture on its surface for every graphic element to 
the frame buffer memory 5b. 

[0011] The frame buffer memory 5b and the texture 
ib memory 6 are generally configured by a DRAM (dynam- 
ic random access memory). 

[001 2] In the system of Fig. 1 5, the frame buffer mem- 
ory 5b and the texture memory 6 are configured as phys- 
ically separated memory systems. 
15 [001 3] Note that, even after it became possible to pro- 
vide the DRAM and a logic circuit together to hold the 
texture data inside has been difficult due to both the limit 
of the DRAM capacity and the processing speed. 
[0014] In the so-called built-in DRAM system of the 
related art above, when the frame buffer memory and 
the texture memory are provided separately in separate 
memory systems, there are the disadvantages as stated 
below: 

[0015] First, the frame buffer memory which is emp- 
tied due to a change of the display resolution cannot be 
used as a texture memory. 

[0016] This becomes a big disadvantage when trying 
to perform all of the processing in the limited capacity of 
a built-in DRAM. 

[0017] Second, when making the frame buffer mem- 
ory and the texture memory physically identical, in si- 
multaneous access to the frame buffer memory and the 
texture memory, the overhead of switching pages of the 
DRAM etc. becomes large, so the performance has to 
be sacrificed. 

[0018] These disadvantages are not particularly lim- 
ited to a graphic drawing apparatus of a built-in DRAM 
type and have been disadvantages in systems of an ex- 
ternally provided DRAM type as well, but have not been 
as serious as the built-in type where the capacity is 
strictly limited. 

SUMMARY OF THE INVENTION 

[0019] An object of the present invention is to provide 
an image processing apparatus capable of performing 
flexible and high speed processing, wherein a memory 
region which is emptied due to a change of display res- 
olution can be used as a texture memory, increase of 
overhead such as switching pages can be prevented, 
and a reduction of the performance is not caused. 
[0020] To achieve the above object, according to a 
first aspect of the present invention, there is provided 
an image processing apparatus comprising a memory 
circuit for storing display data and texture data required 
by at least one graphic element and a logic circuit for 
performing processing for applying the texture data on 
the surface of the graphic element of the display data 
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based on the stored data of the memory circuit, the 
memory circuit and the logic circuit being both accom- 
modated in one semiconductor chip. 
[0021] According to a second aspect of the present 
invention, there is provided an image processing appa- 
ratus lor performing rendering by receiving polygon ren- 
dering data including three-dimensional coordinates (x, 
y, z), R (red), G (green), and B (blue), homogeneous 
coordinates (s, t) of a texture, and a homogeneous term 
q with respect to vertexes of a unit graphic, at least com- 
prising a memory circuit for storing display data and tex- 
ture data required by at least one graphic element; an 
interpolation data generation circuit for interpolating the 
polygon rendering data of vertexes of the unit graphic 
to generate interpolation data of pixels positioned inside 
the unit graphic; and a texture processing circuit for di- 
viding the homogeneous coordinates (s, t) of a texture 
included in the interpolation data by the homogeneous 
term q to generate "s/q" and Vq", using a texture ad- 
dress corresponding to the "s/q n and "t/q" to read texture 
data from the memory circuit, and applying the texture 
data on the surface of the graphic element of the display 
data; the memory circuit, the interpolation data genera- 
tion circuit, and the texture processing circuit being ac- 
commodated in one semiconductor chip. 
[0022] In the present invention, the memory circuit is 
divided into a plurality of modules having identical func- 
tions; and the logic circuit accesses the modules in par- 
allel. 

[0023] Further, in the present invention, display ele- 
ments at adjacent addresses in a display address space 
are arranged to be different memory blocks in the mem- 
ory circuit. 

[0024] Further, in the present invention, indexes in in- 
dex colors and values of a color look-up table for refer- 
ring to colors are stored in the memory circuit. 
[0025] Further, information of a depth of an object to 
be drawn is stored in the memory circuit. 
[0026] According to the present invention, by provid- 
ing a DRAM or other memory circuit and a logic circuit 
together in one semiconductor chip and by storing dis- 
play data and texture data required by at least one 
graphic element in a built-in memory circuit, the texture 
data can be stored in a portion other than a display re- 
gion, so the built-in memory can be efficiently used. 
[0027] Also, by giving identical functions in the mem- 
ory circuit in parallel as a plurality of independent mod- 
ules, the efficiency of parallel operations is improved. 
When the number of bits of the data is simply large, the 
efficiency of use ol data is deteriorated and improve- 
ment of the performance is limited to cases of certain 
conditions, but to improve the average performance, bit 
lines can be efficiently used by providing a plurality of 
modules having certain degrees of functions. 
[0028] Further efficient usage of the bit lines becomes 
possible by tinkering with the arrangement of the built- 
in memory circuit, that is, the address space occupied 
by the independent memory plus function modules. 



[0029] When there are frequent accesses to relatively 
fixed display regions as in drawing graphics, by arrang- 
ing the display elements at adjacent addresses to be 
different memory blocks from each other in the display 
5 address space, the probability increases of the modules 
being able to perform processing simultaneously and 
the drawing performance can be improved. Frequent 
accesses to a fixed display region, when drawing inside 
the closed region of a triangle etc., result in access to 

'10 adjacent addresses because the internal display ele- 
ments are adjacent to each other. 
[0030] Further, in order to store more texture data, the 
texture data can be compressed by storing indexes of 
index colors and values of a color look-up table therefor 

1$ inside the built-in memory circuit. As a result the built- 
in memory circuit can be efficiently used. 
[0031] Also, according to the present invention, by 
storing depth information of an object to be drawn in the 
built-in memory circuit, hidden plane processing can be 

so performed simultaneously and in parallel with the draw- 
ing. 

BRIEF DESCRIPTION OF THE DRAWINGS 

25 [0032] These and other objects and features of the 
present invention will become clearer from the following 
description of the preferred embodiments given with ref- 
erence to the accompanying drawings, in which: 

30 Fig. 1 is a block diagram of the configuration of a 
three-dimensional computer graphic system ac- 
cording to the present invention; 
Figs. 2A to 2C are views for schematically explain- 
ing methods of storing display data, depth data, and 
35 texture data in a DRAM according to the present 
invention; 

Fig. 3 is a block diagram of a specific example of 
the configuration of the memory l/F circuit for ac- 
cessing a DRAM, SRAM, and DRAM and SRAM in 
40 the rendering circuit according to the present inven- 
tion; 

Figs. 4A and 4B are schematic views of an example 
of the configuration of a DRAM buffer according to 
the present invention; 
45 Fig. 5 is a view for explaining simultaneously acces- 
sible pixel data included in the texture data. 
Fig. 6 is a view for explaining a unit block constitut- 
ing the texture data; 

Fig. 7 is a view for explaining the address space of 
so a texture buffer, 

Fig. 6 is a view for explaining the function of a DDA 
set-up circuit according to the present invention; 
Fig. 9 is a view for explaining the function of a trian- 
gle DDA circuit according to the present invention; 
55 Fig. 10 is a view for explaining the sorting process- 
ing of the vertexes of a triangle DDA circuit accord- 
ing to the present invention; 

Fig. 11 is a view for explaining the calculation 
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processing of an inclination in the horizontal direc- 
tion of a triangle DDA circuit according to the 
present invention; 

Figs. 12A and 12B are views for explaining a pro- 
cedure for interpolation of the vertex data of a trian- s 
gle DDA circuit according to the present invention; 
Fig. 1 3 is a flow chart for explaining the procedure 
for interpolation of the vertex data of a triangle DDA 
circuit according to the present invention; 
Fig. 14 is a view for explaining the image data 10 
processing of a distributer in a memory l/F circuit 
according to the present invention; and 
Fig. 15 is a view of the system configuration show- 
ing the basic concept of a three-dimensional com- 
puter graphic system. ?5 

DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 

[0033] Below, preferred embodiments will be de- 20 
scribed with reference to the accompanying drawings. 
[0034] In the present embodiment, an explanation will 
be made of a three-dimensional computer graphic sys- 
tem for displaying a desired three-dimensional image of 
any three-dimensional object model at a high speed on 2s 
a display such as a CRT (cathode ray tube) - which is 
often used in personal computers etc. 
[0035] Figure 1 is a view of the system configuration 
of a three-dimensional computer graphic system 10 as 
an image processing apparatus according to the 30 
present invention. 

[0036] The three-dimensional computer graphic sys- 
tem 10 expresses a three-dimensional model as a com- 
posite of triangular unit graphics (polygons). By drawing 
the polygons, the colors of the pixels of the display 35 
screen are decided and the polygon rendering for dis- 
play on the display is performed. 

[0037] Also, in the three-dimensional computer 
graphic system 10, a three-dimensional object is ex- 
pressed by using a z-coordinate indicating a depth in , 40 
addition to the coordinates (x, y) indicating a position on 
a plane. The three coordinates (x, y, z) specify any one 
point in a three-dimensional space. 
[0038] As shown in Fig. 1 , the three-dimensional com- 
puter graphic system 1 0 is connected to a main proces- 45 
sor 11, a main memory 12, an I/O interface circuit 13, 
and a rendering circuit 14 via a main bus 15. 
[0039] Below, the functions of the respective compo- 
nents will be explained. 

[0040] The main processor 11 reads the necessary so 
graphic data from the main memory 1 2 in accordance 
with, for example, the state of progress of an application 
and performs clipping, lighting, and geometric process- 
ing etc. on the graphic data and generates polygon ren- 
dering data. The main processor 11 outputs the polygon 55 
rendering data S11 to the rendering circuit 14 via the 
main bus 15. 

[0041] The I/O interface circuit 13 receives as input 
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the motion control information or the polygon rendering 
data from the outside in accordance with need and out- 
puts the same to the rendering circuit 14 via the main 
bus 15. 

[0042] Here, the polygon rendering data includes the 
data (x, y, z, R, G, B ; s, t, q) of each of the three vertexes 
of the polygon. 

[0043] Here, the (x, y, z) data indicates the three-di- 
mensional coordinates of a vertex of the polygon, and 
(R, G, B) indicates the luminance values of red, green, 
and blue at the three-dimensional coordinates, respec- 
tively. 

[0044] Among the (s, t, q) data, the (s, t) indicates ho- 
mogeneous coordinates of a corresponding texture and 
the q indicates a homogeneous term. Here, the texture 
sizes USIZE and VSIZE are respectively multiplied with 
the "s/q" and M t/q B to obtain the texture coordinate data 
(u, v). The texture coordinate data (u, v) is used for ac- 
cessing the texture data stored in the texture buffer 
147a. 

[0045] Namely, the polygon rendering data consists 
of physical coordinate values of the vertexes of a trian- 
gle and colors and texture data of each of the vertexes. 
[0046] The rendering circuit 1 4 will be explained in de- 
tail below. 

[0047] As shown in Fig. 1, the rendering circuit 14 
comprises a DDA (digital differential analyzer) set-up 
circuit 141, a triangle DDA circuit 142, a texture engine 
circuit 143, a memory interface (l/F) circuit 144, a CRT 
control circuit 1 45, a RAMDAC circuit 1 46, a DRAM 1 47, 
and an SRAM (static RAM) 148. 
[0048] The rendering circuit 1 4 of the present embod- 
iment is provided with a logic circuit and a DRAM 147 
for storing at least display data and texture data together 
in one semiconductor chip. 

DRAM 147 

[0049] The DRAM 147 functions as a texture buffer 
147a, a display buffer 147b, a z-buffer 147c, and a tex- 
ture color look-up table (CLUT) buffer 147d. 
[0050] Also, the DRAM 147 is, as will be explained 
later on, divided into a plurality of modules (4 in this em- 
bodiment) having identical functions. 
[0051] Also, in the DRAM 147, in order to store more 
texture data, indexes in index colors and values for the 
color look-up table therefor are stored in the texture 
CLUT buffer 147d. 

[0052] The indexes and the values of the color look- 
up table are used for texture processing. Namely : a tex- 
ture element is normally expressed by the total 24 bits 
of the 8 bits of each of R, G, and B. With this, however, 
the amount of data swells, so one color is selected from, 
for example, 256 colors selected in advance and that 
data is used for the texture processing. As a result, with 
256 colors, the texture elements can be expressed by 
8 bits. A conversion table from the indexes to an actual 
color is necessary, however, the higher the resolution of 
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the texture, the more compact the texture data can be- 
come. 

[0053] Due to this, compression of the texture data 
becomes possible and the built-in DRAM can be used 
efficiently. 5 
[0054] Further, depth information of the object to be 
drawn is stored in the DRAM 147 in order to perform 
hidden plane processing simultaneously and in parallel 
with the drawing. 

[0055] Note that as a method of storing the display i'o 
data, the depth data, and the texture data, for example, 
the display data is stored at a predetermined position in 
the memory block, for example, continuously from the 
top, then the depth data is stored, and then the texture 
data is stored in continuous address spaces for each is 
type of texture in the remaining vacant region. 
[0056] Explaining this conceptually with reference to 
the drawings., as shown in Figs. 2A to 2C : the display 
data and the depth data are stored, for example, by a 
24-bit width in the region indicated by FB in the figure 20 
from a position indicated by a so-called base pointer 
(BP), while the texture data is stored in the remaining 
vacant region., that is, the 6-bit width region, as shown 
by TB in the figure. 

[0057] As a result, the texture data can be stored ef- 25 
ficiently. 

[0058] Figure 3 is a block diagram of a specific exam- 
ple of the configuration of the DRAM 147, the SRAM 
148, and the memory l/F circuit 144 accessing the 
DRAM 1 47 and the SRAM 1 48. 30 
[0059] As shown in Fig. 3, the DRAM 147 and the 
SRAM 148 shown in Fig. 1 are divided into four memory 
modules 200 : 210, 220, and 230, as explained above. 
[0060] The memory module 200 comprises memories 
201 and 202. 35 
[0061] The memory 201 comprises banks 201 A and 
201 B comprising a part of the DRAM 147 and banks 
201 C and 201 D comprising a part ofthe SRAM 148. 
[0062] Also, the memory 202 comprises banks 202A 
and 202B comprising a part of the DRAM 147 and the. 40 
banks 202C and 202D comprising a part of the SRAM 
148. 

[0063] Note that the banks 201 C, 201 D, 202C, and 
202D comprising the SRAM 1 48 are simultaneously ac- 
cessible. 45 
[0064] The memory module 210 comprises memories 
211 and 21 2. 

[0065] The memory 211 comprises banks 211 A and 
21 1B comprising a part of the DRAM 147 and banks 
211C and 21 1D comprising a part of the SRAM 148. so 
[0066] Also, the memory 212 comprises banks 21 2A 
and 212B comprising a part of the DRAM 147 and banks 
21 2C and 21 2D comprising a part of the SRAM 143. 
[0067] Note that the banks 211C, 211D, 212C, and 
21 2D comprising the SRAM 1 48 are simultaneously ac- S5 
cessible. 

[0068] The memory module 220 comprises memories 
221 and 222. 
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[0069] The memory 221 comprises banks 221 A and 
221 B comprising a part of the DRAM 147 and banks 
221 C and 221 D comprising a part of the SRAM 1 48. 
[0070] Also, the memory 222 comprises banks 222A 
and 222B comprising a part of the DRAM 1 47 and banks 
222C and 222D comprising a part of the SRAM 148. 
[0071] Note that the banks 221C, 221 D, 222C, and 
222D comprising the SRAM 148 are simultaneously ac- 
cessible. 

[0072] The memory module 230 comprises memories 
231 and 232. 

[0073] The memory 231 comprises banks 231 A and 
231 B comprising a part of the DRAM 147 and banks 
231 C and 231 D comprising a part of the SRAM 148. 
[0074] Also, the memory 232 comprises banks 232 A 
and 232B comprising a part of the DRAM 1 47 and banks 
232C and 232D comprising a part of the SRAM 148. 
[0075] Note that the banks 231 C, 231 D, 232C, and 
232D comprising the SRAM 148 are simultaneously ac- 
cessible. 

[0076] Here, each of the memory modules 200, 210, 
220, and 230 has all of the functions of the texture buffer 
1 47a, the display buffer 1 47b, the z-buffer 1 47c, and the 
texture CLUT buffer 147d shown in Fig. 1. 
[0077] Namely, each of the memory modules 200. 
210, 220, and 230 stores all of the texture data, the 
drawing data ((R, G, B) data), the z-data, and the texture 
color look-up table data of corresponding pixels. 
[0078] Note that the memory modules 200, 21 0, 220, 
and 230 store data of different pixels from each other. 
[0079] Here, the texture data, the drawing data, the z- 
data, and the texture color look-up table data for 16 pix- 
els being processed simultaneously are stored in mutu- 
ally different banks 201 A, 201 B, 202 A, 202B, 211 A, 
211B, 212A, 212B, 221A, 221B, 222A, 222B, 231A, 
231 B, 232A, and 232B. 

[0080] Due to this, the memory !/F circuit 1 44 can si- 
multaneously access the DRAM 147 for data of, for ex- 
ample, 2x8 pixels, that is, 16 pixels. 
[0081] Note that the memory l/F circuit 1 44, as wilt be 
explained later on, accesses (writes) the DRAM 147 
based on so-called predetermined interleave type ad- 
dressing. 

[0082] Figures 4A and 4B are schematic views of an 

example of the configuration of the DRAM 147 serving 

as a buffer (for example, a texture buffer). 

[0083] As shown in Figs. 4A and 4B, data accessed 

in a 2 x 8 pixel region is stored in a region indicated by 

a page (row) and a block (column). 

[0084] Each of the rows ROW0 to ROWn+1 are, as 

shown in Fig. 4A, respectively divided into 4 columns 

(blocks) MOA, MOB, MIA, and M1B. 

[0085] Accesses (writing and reading) are performed 

in regions of a boundary of 3 pixels each in an x-direction 

and of an even number of boundaries in a y-direction. 

[0086] Due to this, access is not made to a region 

straddling, for example, the row ROW0 and the row 

ROW1 , so that so-called page violation does not occur. 
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[0087] Note that the texture data respectively stored 
in the banks 201 A, 201 B. 202A, 202B, 211 A, 211 B, 
21 2A, 21 2B, 221 A, 221 B, 222A, 222B, 231 A, 231 B, 
232A, and 232B are stored in the banks 201 C, 201 D, 
202C, 202D, 21 1C, 21 1D, 21 2C, 21 2D, 221 C, 221 D, 
222C, 222D, 231 C, 231 D, 232C, and 232D. 
[0088] Next, storage patterns of the texture data in the 
texture buffer 147a based on the interleave type ad- 
dressing will be explained in further detail with reference 
to Figs. 5 to 7. 

[0089] Figures is a viewfor explaining simultaneously 
accessible pixel data included in the texture data, Fig. 
6 is a view for explaining unit blocks comprising texture 
data, and Fig. 7 is a view for explaining an address 
space of the texture buffer. 

[0090] In the present embodiment, as shown in Fig. 
5, pixel data P 0 to P 15 included in the texture data, ar- 
ranged in a 2 x 8 matrix, and indicating color data of 
pixels are simultaneously accessible. 
[0091] The pixel data P 0 to P 15 have to be stored in 
different banks of the SRAM 148 comprising the texture 
buffer 147a. 

[0092] In the present embodiment, the pixel data P 0 , 
Pt , P 8 , and P 9 are respectively stored in the banks 201 C 
and 201 D of the memory 201 and banks 202C and 202D 
of the memory 202 shown in Fig. 3. Also, the pixel data 
P 2 , P3, P-jo» ar| d P11 are respectively stored in the banks 
21 1 C and 21 1D of the memory 211 and banks 21 2C and 
21 2D of the memory 212 as shown in Fig. 3. The pixel 
data P 4 , P s , P 12 , and P 13 are respectively stored in the 
banks 221C and 221 D of the memory 221 and banks 
222C and 222D of the memory 222 as shown in Fig. 3. 
Further, the pixel data P 6 , P 7 , P 14 , and P 15 are respec- 
tively stored in the banks 231 C and 231 D of the memory 
231 and the banks 232C and 232D of the memory 232 
as shown in Fig. 3. 

[0093] I n the present embodim ent ; the set of p ixel da- 
ta P 0 to P 15 of the pixels positioned inside a block region 
and simultaneously processed is called a unit block R } . 
For example, texture data indicating one image is com- , 
posed of unit blocks R 0 to R B a-i arranged in a B x A 
matrix, as shown in Fig. 6. 

[0094] The unit blocks R 0 to RBA-1 are, as shown in 
Fig. 7, stored in the DRAM 147 comprising the texture 
buffer 147a so as to have continuous addresses in a 
one-dimensional address space. Also, the pixel data P 0 
to P 15 in each of the unit blocks R 0 to R BA .-, are stored 
in different banks from each other in the SRAM 143 so 
as to have continuous addresses in a one-dimensional 
address space. 

[0095] Namely, a unit block composed of pixel data 
which are simultaneously accessible is stored in the tex- 
ture buffer 147a so as to have continuous addresses in 
a one-dimensional address space. 

DDA Set-up Circuit 141 

[0096] Prior to the succeeding triangle DDA circuit 



142 performing linear interpolation on the values of the 
vertexes of a triangle on a physical coordinate system 
to obtain information of the color and depth of the pixels 
inside the triangle, the DDA set-up circuit 141 performs 
5 a set-up operation for obtaining the difference of a side 
of the triangle from a horizontal direction etc. for the data 
(z, R, G ; B, s, t, q) indicated by the polygon rendering 
data S11. 

[0097] Specifically, this set-up operation uses values 
10 of a starting point and an end point and the distance 
between the two points to calculate the change ol values 
to be sought in the case of movement by a unit length. 
[0098] The DDA set-up circuit 141 outputs the calcu- 
lated change data S141 to the triangle DDA circuit 142. 
15 [0099] The function of the DDA set-up circuit 141 will 
be further explained with reference to Fig. 8. 
[0100] As explained above, the main processing of 
the DDA set-up circuit 1 41 is to obtain the change inside 
a triangle composed of three vertexes given various in- 
20 formation (color and textu re coordinates) at vertexes re- 
duced to physical coordinates through the former geo- 
metric processing so as to calculate basic data for the 
later linear interpolation. 

[0101] Note that the data of each vertex of the triangle 
25 js, for example, configured by 16 bits of x- and y-coor- 

dinates, 24 bits of the z-coordinate, 12 bits (=8+4) of the 

color values for the RGB, and 32 bits of floating decimal 

values (IEEE format) of the s, t, q texture coordinates. 

[0102] While the drawing of a triangle is reduced to 
30 the drawing of a horizontal line, this makes it necessary 

to obtain the starting values at the starting point of the 

drawing of the horizontal line. 

[0103] In drawing the horizontal line, the direction of 
drawing is made constant in one triangle. For example, 

35 when drawing from the left to the right, the X with respect 
to a displacement in the y-direction of a side on the left 
and the above various changes are calculated first, then 
these are used to find the X-coordinate of the left-most 
point when moving from a vertex to the next horizontal 

to line and values of the above various information (points 
on a side change in both the X-and y-directions, so cal- 
culation is impossible only from the inclination of the Y- 
direction). 

[0104] Only the position of the end point is required 
^5 for the side on the right, so only the change of x with 
respect to the displacement in the y-direction need be 
investigated. 

[0105] Regarding the drawing of a horizontal line, 
since the inclination in the horizontal direction is uniform 
so in the same triangle : the inclinations of the above vari- 
ous information are calculated, 

[0106] The given triangle is sorted in the y-direction 
and the upper-most point is set to be A. Next, the re- 
maining two vertexes are compared in terms of the po- 
55 sitions in the X-direction and the point on the right is set 
to be B. By doing this, the processing can be divided 
into only two or so steps. 
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Triangle PDA Circuit 142 

[0107] The triangle DDA circuit 142 uses the change 
data S141 input from the DDA set-up circuit 141 to cal- 
culate the linearly interpolated (z ; R, G : B, s ; t, q) data 
at each pixel inside the triangle. 

[0108] The triangle DDA circuit 142 outputs the data 
(x, y) for each pixel and the (z. R, G, B, s, t, q) data at 
the (x, y) coordinates to the texture engine circuit 143 
as the DDA data (interpolation data) S142. 
[01 09] For example, the triangle DDA circuit 1 42 out- 
puts the DDA data S142 of the 8 (= 2 x 4) pixels posi- 
tioned inside a block being processed in parallel to the 
texture engine circuit 143. 

[011 0] A further explanation will be made of the func- 
tion of the triangle DDA circuit 1 42 with reference to Fig. 
9. 

[0111] As explained above, inclination information of 
the above various information of the sides and horizon- 
tal direction of a triangle is prepared by the DDA set-up 
circuit 141. The basic processing of the triangle DDA 
circuit 142 receiving this information consists of the cal- 
culation of the initial values of the horizontal line by in- 
terpolation of the various information on the sides of the 
triangle and the interpolation of the various information 
on the horizontal line. 

[0112] Here, what must be noted most is that the cal- 
culation of results of the interpolation requires calcula- 
tion of the values at the center of a pixel. 
[0113] The reason is that if the value calculated is off 
from the center of the pixel, while there is not much to 
worry about in the case of a still picture, the flickering of 
the image will stand out in a motion picture. 
[0114] The various information at the left-most side of 
a first horizontal line can be obtained by multiplying the 
inclination on the side with the distance from the vertex 
to the first line. 

[0115] The various information at the starting point of 
the next line can be calculated by adding the inclination 
of the side. 

[0116] The value at the first pixel of the horizontal line 
can be calculated by adding the value obtained by mul- 
tiplying the distance to the first pixel with the inclination 
in the horizontal direction to the value at the starting 
point of the line. The value at the next pixel of the hori- 
zontal line can be calculated by adding to the first pixel 
value the inclination in the horizontal direction succes- 
sively. 

[0117] Next, sorting of vertexes will be explained with 
reference to Fig. 10. 

[01 1 8] By sorting the vertexes in advance, the branch- 
ing of the successive processing can be reduced to a 
minimum and contradictions can be made harder to oc- 
cur inside one triangle as much as possible even in in- 
terpolation. 

[0119] As the method of sorting, first, all of the vertex- 
es supplied are sorted in the y-direction and the upper- 
most point and the lower-most point are defined as the 



point A and point C, respectively. The remaining point 
is defined as the point B. 

[0120] By doing so, in the processing, the side extend- 
ing the longest in the Y-direction becomes a side AC. 
5 First, the side AC and the side AB are used for the in- 
terpolation of the region between the two sides, then in- 
terpolation is performed for the region between the side 
BC and the side AC, that is, leaving the side AC as it is 
and changing from the side AB. Also, it will be under- 
go stood that it is sufficient to perform processing with re- 
spect to the side AC and the side BC for correction on 
the pixel coordinate lattice in the Y-direction. 
[0121] Since branching of the processing after sorting 
becomes unnecessary in this way, the processing can 
15 be performed simply by supplying the data, bugs can be 
prevented from occurring, and the configuration be- 
comes simple. 

[0122] Also, since the direction of the interpolation in 
one triangle can be made constant by setting a starting 

20 point on the side BC, the direction of interpolation (span) 
in the horizontal direction becomes constant and any 
computation errors which occur are accumulated from 
the side BC to other sides. Since the direction of the 
accumulation becomes constant, errors between adja- 

25 cent sides become less conspicuous. 

[0123] Next, the calculation of the inclination in the 
horizontal direction will be explained with reference to 
Fig. 11. 

[0124] The inclination (variables) of the variables (x, 
30 y, z, R, G, 8, s, t, q) inside a triangle with respect to (x, 
y) becomes constant due to the linear interpolation. 
[0125] Accordingly the inclination in the horizontal di- 
rection, that is, the inclination on each of the horizontal 
lines (span), becomes constant for all spans, so the in- 
35 clinatton is obtained prior to the processing of the spans. 
[0126] As a result of sorting the given vertexes of the 
triangle in the Y-direction, the side AC is defined again 
to be the longest extending side, so there is always a 
point of intersection of a line extending from the vertex 
*o B in the horizontal direction and the side AC. The point 
is defined as D. 

[0127] After this, by just obtaining the change be- 
tween the point Band D, the inclination in the horizontal 
direction, that is, in the x-direction, can be obtained. 
^5 [0128] Specifically, the x- and z-coordinates at the 
point D becomes as shown in the equations (1) below. 

X d= {(yd-yaWVc-yaJHXc-Xa) 

50 

z d = {(Vd-yaViyc-y.JHv 2 .) 0) 

[0129] When obtaining the inclination of the variable 
z in the x-direction based on this, the following is ob- 
55 tained: 
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Az/Ax =(Zd-Zb)/(x<i-Xb) 

=[ { (yd-y a )/(y c -ya) } -(zc-^-Zb]/ 
[{(y<ryt)/(yc-yO}-(xc-xa)-Xb] 
= {2b(y c -y a )-(zc-z a Xyc-y a ) } / 
{xb(yc-ya)-(zc-za)(yc-ya)} 

[0130] Next, an example of the routine for interpola- 
tion of vertex data will be explained with reference to 
Figs. 12A : 12B and 13. 

[0131] After the processing for sorting the vertexes, 
calculating the inclination in the horizontal direction, and 
calculating the inclination on each of the sides, interpo- 
lation is carried out using the results. 
[0132] Depending the position of the point B, the 
processing at a span splits in two directions. This is be- 
cause it is desired to perform the processing by always 
using the side extending the longest in the y-direction 
as a starting point so as to try to prevent trouble as much 
as possible by making the direction of accumulation of 
errors between respective spans in interpolation inside 
one triangle constant. 

[0133] When the point B is at the same height as the 
point A, the first half of the processing is skipped. There- 
fore, the processing can be streamlined by just providing 
a skippable mechanism rather than branching. 
[0134] When trying to improve the processing capa- 
bility by simultaneously processing a plurality of spans, 
it is desired to obtain the inclination in the Y-direction, 
however it is necessary to carry out the processing 
again from the sorting of the vertexes. However, the 
processing before the interpolation processing is 
enough, so the processing system as a whole becomes 
simpler. 

[0135] Specifically, when point B is not the same 
height as the point A, Y-direction correction of AC and 
AB (calculation of values on a pixel lattice) is performed 
(ST1 and ST2) and the interpolation on the side AC and 
the interpolation on the side AB are performed (ST3). 
[0136] Then, the correction in the AC horizontal direc- 
tion and the interpolation on the horizontal line from the 
side AC in the side AB direction (ST4) are carried out. 
The above processing of steps ST3 and ST4 are per- 
formed until the end of the side AB (ST5). 
[01 37] When the processing of steps ST2 to ST4 until 
the end of the side AB is completed or when it is judged 
at step ST1 that the point B is the same height as the 
point A, the y-direction correction of BC (calculation of 
values on the pixel lattice) is carried out (ST6) and the 
interpolation on the side AC and the interpolation on the 
side BC are carried out (ST7). 

[01 38] Then : the correction in the AC horizontal direc- 
tion and the interpolation on the horizontal line (span) 
are carried out (ST8). 

[0139] The processing of the above steps ST7 and 



ST8 is carried out until the end of the side BC (ST9). 
Texture Engine Circuit 143 

s [0140] The texture engine circuit 143 performs the 
calculation of "s/q" and °t/q n , the calculation of the tex- 
ture coordinate data (u, v), and the reading of the (R, G, 
B) data from the texture buffer 1 47a in a pipeline format. 
[0141] Note that the texture engine circuit 143 per- 

io forms the processing on the 8 pixels positioned inside 
a predetermined block simultaneously in parallel. 
[0142] The texture engine circuit 143 performs the op- 
eration for dividing the data s by the data q and the op- 
eration for dividing the data t by the data q on the (s, t, 

is q) data indicated by the D DA data S142. 

[0143] The texture engine circuit 143 is provided with, 
for example : eight division circuits, not illustrated, and 
performs the division "s/q" and "t/q" simultaneously on 
the 8 pixels. 

20 [01 44] Also, the textu re engine circuit 1 43 respective- 
ly multiplies the texture sizes USIZE and VSIZE with the 
division results "s/q" and "t/q" to generate the texture 
coordinate data (u, v). 

[0145] The texture engine circuit 143 outputs a read 
2S request including the generated texture coordinate data 
(u , v) to the SRAM 1 48 or DRAM 1 47 via the memory 1/ 
F circuit 144. The texture engine circuit 143 obtains the 
(R, G, B) data SI 48 stored at the texture address cor- 
responding to the (s, t) data by reading the texture data 
30 stored in the SRAM 1 48 or in the texture buffer 1 47a via 
the memory t/F circuit 144. 

[0146] Here, the texture data stored in the texture 
buffer 147a is stored in the SRAM 148 as explained 
above. 

35 [0147] The texture engine circuit 143 generates pixel 
data SI 43 by combining etc. the (R, G, B) data in the 
read (R, G, B) data S1 48 and the (R, G, B) data included 
in the DDA data S142 from the former triangle DDA cir- 
cuit 142 . 

40 [0148] The texture engine circuit 1 43 outputs the pixel 
data SI 43 to the memory l/F circuit 144. 
[0149] Note that in the texture buffer 147a, MIPMAP 
(texture for a plurality of resolutions) and other texture 
data corresponding to a plurality of reducing rates is 

45 stored. Here, which reducing rate texture data to use is 
determined in the above triangular units using a prede- 
termined algorithm. 

[0150] In the case of a full color mode, the texture en- 
gine circuit 143 directly uses the (R. G, B)data read from 

so the texture buffer 147a. 

[0151] In the case of an index color mode, the texture 
engine circuit 143 reads a color look-up table (CLUT), 
prepared in advance, from the texture CLUT buffer 
147d, transfers and stores the same in the built-in 

55 SRAM, and uses the color look-up table to obtain the 
(R, G, B)data corresponding to the color index read from 
the texture buffer 147a. 
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Memory l/F Circuit 144 

[01 52] The memory l/F circuit 1 44 compares the z-da- 
ta corresponding to the pixel data S143 input from the 
texture engine circuit 143 with the z-data stored in the s 
z-buffer 147c and judges whether the image drawn by 
the input pixel data S1 43 is positioned closer to the view- 
ing point than the image written in the display buffer the 
previous time. When it is judged that the image drawn 
by the input pixel data S143 is positioned closer, the Vo 
memory l/F circuit 144 updates the z-data stored in the 
z-buffer 1 47c by the z-data corresponding to the image 
data S143. 

[0153] Also, the memory l/F circuit 144 writes the (R, 
G, B) data to the display buffer 147b. 15 
[01 54] Furthermore, the memory VF circuit 1 44 reads 
the (Ft, G, B) data SI 48 stored in the SRAM 148 when 
receiving a read request including the generated texture 
coordinate data (u, v) from the texture engine circuit 143 
to the SRAM 148. 20 
[0155] Also, when receiving a request for reading dis- 
play data from the CRT control circuit 145, the memory 
l/F circuit 1 44 reads a certain amount of the display data, 
for example, in units of S pixels or 16 pixels, from the 
display butler 147b in response to the request. 25 
[0156] The memory l/F circuit 144 performs access- 
ing (writing or reading) to the DRAM 1 47 and to the 
SRAM 148 in this way, but the write path and a read path 
are structured as separate paths. 

[0157] Namely, when writing, a write address ADRW 30 
and write data DTW are processed in the write system 
and written to the DRAM 147, while when reading, 
processing is carried out in the read system to read from 
the DRAM 147 or the SRAM 148. 

[0158] The memory l/F circuit 144 performs access- 35 
ing to the DRAM 147 based on a predetermined inter- 
leave type addressing in, lor example, 16-pixel units. 
Below, an example of a specific configuration of the 
memory l/F circuit 144 will be explained with reference 
to Fig. 3. r 40 

[0159] The memory l/F circuit 144 comprises, as 
shown in Fig. 3, a distributer 300, address converters 
31 0, 320, 330, and 340, memory controllers 350, 360, 
370, and 380, and a read controller 390. 
[0160] At the time of writing, the distributer 300 re- 4S 
ceives as input 16 pixels worth of (R, G, B) data DTW 
and a write address ADRW, divides them into four image 
data S301, S302, S303, and S304, and outputs the im- 
age data and the write address to the address convert- 
ers 310, 320, 330.. and 340. so 
[0161] Here, the (R : G, B) data for one pixel is com- 
posed of eight bits and the z-data is composed of 32 bits. 
[0162] When writing, the address converters 310, 
320, 330, and 340 convert the addresses corresponding 
to the (R, G, B) data and the z-data input from the dis- ss 
tributer 300 to addresses in the memory modules 200, 
21 0, 220, and 230 and output the respectively converted 
addresses S31 0, S320, S330, and S340 and the divided 
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image data to the memory controllers 350, 360, 370 : and 
3S0. 

[01 63] Figure 1 4 is a schematic view of the image data 
processing (pixel processing) of the distributer 300. 
[01 64] This figure corresponds to the above Figs. 4 to 
7. The distributer 300 performs image data processing 
for the DRAM 147 in order that data of, for example, 2 
x 8 pixels, that is, 16 pixels, can be simultaneously ac- 
cessed. 

[0165] Then, the distributer 300 performs image data 
processing in order to give addressing for access (writ- 
ing, reading) to a region of a boundary of 8 pixels in the 
x-direction and an even number of boundaries in the y- 
direction. 

[01 66] As a result, in the DRAM 1 47, the top of access 
always becomes a memory celt number MCN "0°, not 
"1 ", "2°, or "3", so occurrence of page violations etc. can 
be prevented. 

[0167] Also, the distributer 300 performs image data 
processing for the DRAM modules 220 to 230 in order 
that adjacent portions in a display region are arranged 
at different DRAM modules. 

[0168] As a result, when drawing a plane such as a 
triangle, simultaneous processing on the plane is pos- 
sible, so the probability ol operation of the respective 
DRAM modules becomes very high. 
[0169] The memory controllers 350, 360, 370, and 
3S0 are respectively connected to the memory modules 
200, 210, 220, and 230 via a wiring group of the write 
system 401 W, 402W, 411 W, 412W, 421 W, 422W, 431 W, 
and 432W and a wiring group of the read system 401 R, 
402R, 411R S 412R, 421R, 422R, 431R; and 432R and 
control access to the memory modules 200, 210, 220, 
and 230 at the time of writing and reading. 
[0170] Specifically, when writing, the memory control- 
lers 350, 360, 370, and 380 are output from the distrib- 
uter 300, and the (R, G, B) data and z-data for 4 pixels 
input from the address converters 350, 360, 370, and 
380 are simultaneously written to the memory modules 
200, 210, 220, and 230 via the wiring group of the write 
system 401 W, 402 W, 41 1 W, 41 2W, 42 1 W ( 422 W, 431 W, 
and 432W. 

[0171] At this time, for example, in the memory mod- 
ule 200, the (R, G, B) data and z-data for 1 pixel are 
stored in each of the banks 201 A, 201 B, 202A, and 
202B. The same is true for the memory modules 210, 
220, and 230. 

[0172] Also, the memory controllers 350, 360, 370, 
and 380 output idle signals S350, S360 : S370, and S380 
to the read controller 390 as active when their own state 
machines are in an idle state. Receiving read addresses 
and a read request signal S391 by the read controller 
390 in response to the idle signals S350, S360, S370, 
and S360, the memory controllers 350, 360, 370, and 
3S0 read data via the wiring group of the read system 
401 R. 402R, 41 1 R, 41 2R, 421 R, 422R, 431 R, and 432R 
and output the same to the read controller 390 via the 
wiring group of the read system 351 , 361 , 371 , and 381 
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and a wiring group 440. 

[0173] Note that, in this embodiment, the number of 
wires of the wiring group of the write system 401 W, 
402 W, 411 W, 412VV. 421 W, 422W, 431 W, and 432W 
and the wiring group of the read system 401 R, 402R, s 
411R, 412R, 421R, 422R, 431R, and 432R is 128 (128 
bits). The number of wires of the wiring groups 351, 361, 
371 , and 381 of the read system is 256 (256 bits). The 
number of wires of the wiring group 440 of the read sys- 
tem is 1024 (1024 bits). io 
[01 74] The read controller 390 is configured by an ad- 
dress converter 391 and a data processing portion 392. 
[0175] When receiving a read address ADRR, if the 
address converter 391 receives all of the idle signals 
S350, S360, S370 s and S380 from the memory control- is 
lers 350, 360, 370, and 380 as active, the address con- 
verter 391 outputs read addresses and a read request 
signal S391 to the memory controllers 350 : 360, 370, 
and 380 in order to read data in units of 8 pixels or 16 
pixels in response to the idle signals S350, S360, S370, 20 
and S380. 

[0176] The data processing portion 392 receives as 
input the texture data, the (R, G, B) data, the z-data, and 
the texture color look-up table data in units of 8 pixels 
or 1 6 pixels read from the memory controllers 350, 360, 25 
370, and 380 via the wiring group 440, performs a pre- 
determined processing, and outputs the result to a re- 
quested destination, for example, to the texture engine 
circuit 143 or the CRT control circuit 145. 
[0177] As explained above, when all of the memory 30 
controllers 350 : 360, 370, and 380 are in an idle state, 
the read controller 390 outputs the read addresses and 
read request signal S391 to the memory controllers 350, 
360, 370, and 380 and receives the read data, therefore 
can synchronize the read data. 35 
[0178] Accordingly, the read controller 390 does not 
require a first-in first-out (FIFO) circuit or other holding 
circuit for temporarily holding data. Thus, the size of the 
circuit can be reduced. 

40 

CRT Control Circuit 145 

[0179] The CRT control circuit 145 generates an ad- 
dress for display on a not shown CRT in synchronization 
with given horizontal and vertical synchronization sig- 4 $ 
nats and outputs a request for reading the display data 
from the display buffer 147b to the memory l/F circuit 
144. In response to this request, the memory t/F circuit 
144 reads a certain amount of the display data from the 
display buffer 1 47b. The CRT controller 1 45 has a built- so 
in FIFO circuit for storing the display data read from the 
display buffer 1 47b and outputs the index value of RGB 
to the RAMDAC circuit 146 at certain time intervals. 

RAMDAC Circuit 146 55 

[0180] The RAMDAC circuit 146 stores the R, G, B 
data corresponding to the respective index values. It 



transfers R, G, B data of a digital format corresponding 
to the index value of RGB input from the CRT controller 
circuit 145 to a not illustrated digital/analog (D/A) con- 
verter to generate R, G. B data o1 an analog format. The 
RAMDAC circuit 146 outputs the generated R, G : Bdata 
to the CRT. 

[0181] Next, the operation resulting from the above 
configuration will be explained. 

[0182] In the three-dimensional computer graphic 
system 10, data of drawing graphics etc, are supplied 
from the main memory 12 of the main processor 11 or 
from the I/O interface circuit 1 3, which receives graphic 
data from the outside, to the rendering circuit 14 via the 
main bus 15. 

[0183] Note that the data for drawing graphics etc. 
are, in accordance with need, subjected to coordinate 
conversion, clipping, lighting, and other geometric 
processing in the main processor 11 etc. 
[0184] The graphic data after the geometric process- 
ing becomes polygon rendering data S11 composed of 
the coordinates x, y, z of the respective three vertexes 
of a triangle, the luminance values R, G, B. the texture 
coordinates s, t, q corresponding to the pixel to be 
drawn, etc. 

[0185] The polygon rendering data S11 is input to the 
DDA set-up circuit 141 of the rendering circuit 14. 
[0186] In the DDA set-up circuit 141, change data 
Si 41 indicating the difference of a side of the triangle 
from a horizontal direction etc. is generated based on 
the polygon rendering data S11. Specifically, the 
change, that is, the amount of change of the values to 
be obtained in the case of movement by a unit length, 
is calculated by using the starting point, the end point, 
and the distance between the two and output to the tri- 
angle DDA circuit 1 42 as change data S1 41 . 
[0187] In the triangle circuit 142, the change data 
SI 41 is used for calculation of the linearly interpolated 
data (z, R, G, B, s, t, q) of each of the pixels inside the 
triangle. 

[0188] Then, the calculated (z, R, G, B, s, t, q) data 
and the (x t y) data of each of the vertexes of the triangle 
are output from the triangle DDA circuit 142 to the tex- 
ture engine circuit 143 as DDA data S142. 
[0189] In the texture engine circuit 143, the operation 
for dividing the data s by the data q and the operation 
for dividing the data t by data q are performed for the (s, 
t : q) data indicated by the DDA data S142. Then, the 
texture sizes USIZE and VSIZE are respectively multi- 
plied with the division results n s/q" and "t/q" to generate 
the texture coordinate data (u, v). 
[0190] Next, a read request including the above gen- 
erated texture coordinate data (u, v) is output from the 
texture engine circuit 143 to the memory l/F circuit 144. 
Then, the (R : G, B) data S143 stored in the SRAM 143 
is read via the memory l/F circuit 144. 
[0191] Next, in the texture engine circuit 143, the (R, 
G, B) data in the read (R, G, B) data Si 48 and the (R, 
G, B) data included in the DDA data S142 from the tri- 



10 



19 



EP 0 940 772 A2 



angle DDA circuit 1 42 in the former stage are combined 
to generate the pixel data S143. 
[0192] This pixel data Si 43 is output from the texture 
engine circuit 143 to the main memory i/F circuit 144. In 
the case of a full color mods, the (R, G, B) data read 
from the texture buffer 147a can be directly used, while 
in the case of an index color mode, data in the color look- 
up table (CLUT), prepared in advance, is transferred 
from the texture CLUT buffer 147d to a buffer for tem- 
porary storage, which is configured by an SRAM etc. By 
using the CLUT of the temporary storage buffer, the ac- 
tual R, G, B colors can be obtained from the color index. 
[0193] Note that when the CLUT is configured by an 
SRAM, when a color index is input to an address of the 
SRAM, the actual R, G, B colors are output. 
[0194] In the memory l/F circuit 144, the z-data cor- 
responding to the pixel data S1 43 input from the texture 
engine circuit 143 and the z-data stored in the z-buffer 
147c are compared for judging whether the image 
drawn by the input pixel data S143 is positioned closer 
to the viewing point than the image written in the display 
buffer the previous time. 

[0195] When it is judged that the image drawn by the 
input pixel data S143 is positioned closer, the z-data 
stored in the buffer 147c is replaced by the z-data cor- 
responding to the image data S143. 
[0196] Next, in the memory l/F circuit 144, the (R, G, 
B) data is written in the display buffer 1 47b. 
[0197] The data to be written (including updated) is 
supplied to the memory controllers 350, 360, 370, and 
380 via the distributer 300 and the address decoders 
310, 320, 330, and 340, which are write system circuits, 
and written to predetermined memories in parallel by the 
memory controllers 350 : 360, 370, and 380 via the wir- 
ing groups of the write system 401 W, 402W, 41 1W, 
41 2 W, 421 W, 422W, 431 W, and 432W. 
[0198] In the memory l/F circuit 144, the memory 
block storing the texture corresponding to the texture 
address of the pixel to be drawn is calculated from the 
texture address, a read request is made only to the 
memory block, and the texture data is read. 
[0199] In this case, since there is no access for read- 
ing texture in memory blocks which do not store the cor- 
responding texture, it is possible to provide more time 
for access for drawing. 

[0200] In the same way for drawing, a memory block 
storing pixel data corresponding to a pixel address to be 
drawn is accessed to read the pixel data from the cor- 
responding address for modify writing, the data is mod- 
ified, then is written back to the same address. 
[0201] When performing hidden plane processing, 
again in the same way a memory block storing depth 
data corresponding to a pixel address to be drawn is 
accessed to read the depth data from the corresponding 
address for modify writing, the data is modified if nec- 
essary, then is written back to the same address. 
[0202] In such transfer of data with the DRAM 147 
based on the memory l/F circuit 144, the drawing per- 
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formance can be improved by performing the process- 
ing up to here in parallel. 

[0203] Especially, a plurality of pixels can be simulta- 
neously calculated by the means of partially increasing 
$ the operating frequency by providing the triangle DDA 
circuit 142 and the texture engine circuit 143 portions in 
the same circuit in a parallel execution mode (spatial 
parallel) or by inserting a lot of pipelines (time parallel). 
[0204] Also, the pixel data is arranged so that the ad- 
jacent portions in the display region are in different mod- 
ules under the control of the memory l/F circuit 144. 
[0205] Due to this, the processing is simultaneously 
carried out on a plane when drawing a plane such as a 
triangle. Therefore, the probability of operation of the re- 
spective DRAM modules is quite high. 
[0206] When displaying an image on a not illustrated 
CRT, in the CRT control circuit 145, the display address 
is generated in synchronization with the given horizontal 
and vertical synchronization frequencies and a request 
for transferring the display data is sent to the memory I/ 
F circuit 144. 

[0207] In the memory l/F circuit 144, in accordance 
with the request, a certain amount of the display data is 
transferred to the CRT control circuit 145. 
[0208] In the CRT control circuit 1 45, the display data 
is stored in a not illustrated display use FIFO etc. and 
index values of RGB are transferred to the RAM D AC 

146 at certain intervals. 

[0209] Note that when a request is made to the mem- 
ory l/F circuit 144 for reading data stored in the DRAM 

147 or in the SRAM 146 as explained above, the read 
address ADRR is input to the address converter 391 of 
the read controller 390. 

[0210] At this time, in the address converter 391, 
whether or not the address signals S350, S360, S370, 
and S380 from the memory controllers 350, 360, 370, 
and 380 are input as active is checked. When all of the 
address signals S350, S360, S370, and S380 are input 
as active, the read addresses and read request signal 
S391 are output to the respective memory controllers 
350, 360, 370, and 380 in order to read data in units of 
8 pixels or 1 6 pixels in response to the address signals 
S350, S360, S370, and S380. 

[0211] Receiving the read addresses and the read re- 
quest signal S391, the texture data, the (R, G, B) data, 
the z-data, and the texture color look-up table data are 
read in units of 8 pixels or 16 pixels in parallel via the 
wiring groups 401 R, 402R, 411 R, 41 2R, 421 R, 422R, 
431 R, and 432R of the read system in the memory con- 
trollers 350, 360, 370, and 380, then input to the data 
processing portion 392 via the wiring groups 351 , 361 , 
371, and 381 of the read system and the wiring group 
440. 

[0212] Then, predetermined processing is carried out 
in the data processing portion 392 and the result is out- 
put to the requested destination, such as the texture en- 
gine circuit 143 or the CRT control circuit 145. 
[021 3] In the RAMDAC 1 46, RGB values with respect 
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to RGB indexes are stored inside the RAM and the RGB 
values with respect to the index values are transferred 
to a not illustrated D/A converter. 
[021 4] Then : the RG B signal converted into an analog 
signal by the D/A converter is transferred to the CRT. 
[0215] As explained above, according to the present 
embodiment, since the built-in DRAM 147 inside a sem- 
iconductor chip is configured to store the display data 
and the texture data required by at least one graphic 
element, the texture data can be stored in portions other 
than the display regions. Therefore, the built-in DRAM 
can be used efficiently, and an image processing appa- 
ratus achieving both high speed operation and reduced 
power consumption can be realized. 
[0216] Further, a single memory system can be real- 
ized and all processing can be carried out inside. As a 
result, there is a large paradigm shift in terms of the ar- 
chitecture as well. 

[021 7] Also, since the memory can be efficiently used, 
the processing can be carried out by just the DRAM in- 
cluded inside. Therefore, the wide bandwidth between 
the memory and the drawing system resulting from the 
internal provision can be sufficiently utilized. Also, it be- 
comes possible to install specific processing in the 
DRAM. 

[0218] Also, by giving identical functions in the mem- 
ory circuit in parallel as a plurality of independent mod- 
ules, the efficiency of parallel operations is improved. 
When the number of bits of the data is simply large, the 
efficiency of use of data is deteriorated and improve- 
ment of the performance is limited to cases of certain 
conditions. To improve the average performance, bit 
lines can be efficiently used by providing a plurality of 
modules having certain degrees of functions. 
[021 9] Further efficient usage of the bit lines becomes 
possible by arranging the display elements at adjacent 
addresses to be different blocks of the DRAM from each 
other in the display address space. When there are fre- 
quent accesses to relatively fixed display regions as in 
drawing graphics, the probability increases of the mod- 
utes being able to perform processing simultaneously 
and the drawing performance can be improved. 
[0220] Further, since indexes of index colors and val- 
ues of a color look-up table therefor are stored inside 
the built-in DRAM 1 47 in order to store more texture da- 
ta, the texture data can be compressed and the built-in 
DRAM can be efficiently used. 

[0221] Also, since depth information of an object to be 
drawn is stored in the built-in DRAM, hidden plane 
processing can be performed simultaneously and in par- 
allel with the drawing. 

[0222] Normally, the drawn picture is desired to be 
displayed, however, since it is possible to store the tex- 
ture data and the display data together as a unified 
memory in the same memory system, the drawing data 
can be used as texture data instead of being used for 
direct display. 

[0223] This is effective when preparing the necessary 



texture data by drawing when necessary. This is also an 
effective function for preventing the amount of the tex- 
ture data from swelling. 

[0224] Also, by providing the DRAM inside the chip, 
s the high speed interface portion is completed just inside 
the chip, so it is no longer necessary to drive an I/O buff- 
er having a large additional capacity or a wiring capacity 
between chips. Therefore, the power consumption can 
be reduced compared with a not built-in case. 
16 [0225] Accordingly, a setup which uses a variety of 
techniques to enable everything to be accommodated 
in a single chip is becoming an essential technical ele- 
ment for future digital equipment such as portable data 
terminals. 

is [0226] Note that the present invention is not limited to 
the above embodiments. 

[0227] In the three-dimensional computer graphic 
system 10 shown in Fig. 1, the example was given of a 
configuration using an SRAM 14S, however, it can be 

20 configured without the SRAM 1 48 as well. 

[0228] Furthermore, in the three-dimensional compu- 
ter graphic system 10 shown in Fig. 1, the example was 
given wherein the geometric processing for generating 
the polygon rendering data was performed in the main 

25 processor 11 , but the processing can be carried out in 
the rendering circuit 14 as well. 

[0229] As explained above, according to the present 
invention, since the memory circuit provided along with 
the logic circuit inside a semiconductor chip is config- 

30 ured to store the display data and the texture data re- 
quired by at least one graphic element, the texture data 
can be stored in portions other than the display regions, 
so the built-in memory circuit can be used efficiently and 
an image processing apparatus achieving both high 

3S speed operation and reduced power consumption can 
be realized. 

[0230] Further, a single memory system can be real- 
ized and all processing can be carried out inside. As a 
result, there is a large paradigm shift in terms of the ar- 

40 chttecture as well. 

[0231] Also, since the memory can be efficiently used, 
the processing can be carried out by just the memory 
included inside. Therefore, the wide bandwidth between 
the memory and the drawing system resulting from the 

45 internal provision can be sufficiently utilized. Also, it be- 
comes possible to install specific processing in the 
memory. 

[0232] Also, by giving identical functions in the mem- 
ory circuit in parallel as a plurality of independent mod- 

50 ' ules, the efficiency of parallel operations is improved. 
[0233] Furthermore, since the display elements at ad- 
jacent addresses are arranged to be in different memory 
blocks to each other in the display address space, when 
there are frequent accesses to relatively fixed display 

55 regions as in drawing graphics, the probability increases 
of the respective modules being able to perform the 
processing simultaneously, so the drawing performance 
can be improved. 
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[0234] Further, since indexes of index colors and val- 
ues of a color look-up table therefor are stored inside 
the built-in memory in order to store more texture data, 
the texture data can be compressed and the built-in 
memory can be efficiently used. 

[0235] Also, since depth information of an object to be 
drawn is stpred in the built-in memory, hidden plane 
processing can be performed simultaneously and in par- 
allel with the drawing. 

[0236] Further, since it is possible to store the texture 
data and the display data together as a unified memory 
in the same memory system, the drawing data can be 
used as texture data instead of being used for direct dis- 
play. 

[0237] Also, by providing the memory inside the chip, 
the high speed interface portion is completed just inside 
the chip, so it is no longer necessary to drive an I/O buff- 
er having a large additional capacity or a wiring capacity 
between chips. Therefore, the power consumption can 
be reduced compared with a not built-in case. 
[0238] While the invention has been described with 
reference to specific embodiment chosen for purpose of 
illustration, it should be apparent that numerous modifi- 
cations could be made thereto by those skilled in the art 
without departing from the basic concept and scope of 
the invention. 



Claims 

1. An image processing apparatus comprising: 

a memory circuit (1 47) for storing display data 
and texture data required by at least one graph- 
ic element and 

a logic circuit for performing processing for ap- 
plying the texture data on the surface of the 
graphic element of the display data based on 
the stored data of the memory circuit (147), 
the memory circuit (147) and the logic circuit, 
being both accommodated in one semiconduc- 
tor chip (14), 

2. An image processing apparatus as set 1orth in claim 

1, wherein: 

the memory circuit (147) is divided into a plu- 
rality of modules (200, 210, 220, 230) having 
identical functions; and 

the logic circuit accesses the modules (200, 
210, 220, 230) in parallel. 

3. An image processing apparatus as set forth in claim 

2, wherein display elements at adjacent addresses 
in a display address space are arranged to be dif- 
ferent memory blocks in the memory circuit (147). 

4. An image processing apparatus as set forth in claim 
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1 , wherein indexes in index colors and values of a 
color look-up table for referring to colors are stored 
in the memory circuit (147). 

5, An image processing apparatus as set forth in claim 
1 , wherein information of a depth of an object to be 
drawn is stored in the memory circuit (147). 

6, An image processing apparatus for performing ren- 
dering by receiving polygon rendering data includ- 
ing three-dimensional coordinates (x, y. z), Ft (red), 
G (green), and B (blue), homogeneous coordinates 
(s, t) of a texture, and a homogeneous term q with 
respect to vertexes of a unit graphic, at least com- 
prising: 

a memory circuit (147) for storing display data 
and texture data required by at least one graph- 
ic element; 

an interpolation data generation circuit (142) for 
interpolating the polygon rendering data of ver- 
texes of the unit graphic to generate interpola- 
tion data of pixels positioned inside the unit 
graphic; and 

a texture processing circuit (143) for dividing 
the homogeneous coordinates (s, t) of a texture 
included in the interpolation data by the homo- 
geneous term q to generate "s/q" and "t/q", us- 
ing a texture address corresponding to the "s/ 
q" and "x/q" to read texture data from the mem- 
ory circuit, and applying the texture data on the 
surface of the graphic element of the display 
data; 

the memory circuit (147), the interpolation data 
generation circuit (142), and the texture 
processing circuit (143) being accommodated 
in one semiconductor chip (14). 

7, An image processing apparatus as set forth in claim 
6, wherein the memory circuit (147) is divided into 
a plurality of modules (200, 210, 220, 230) having 
identical functions and the modules (200, 210, 220, 
230) are accessed in parallel. 



45 8. An image processing apparatus as set forth in claim 
7, wherein display elements at adjacent addresses 
in a display address space are arranged to be dif- 
ferent memory blocks in the memory circuit (147). 

50 9. An image processing apparatus as set forth in claim 
6, wherein indexes in index colors and values of a 
color look-up table 1or referring to colors are stored 
in the memory circuit (147). 

55 10. An image processing apparatus as set forth in claim 
6, wherein information of a depth of an object to be 
drawn is stored in the memory circuit (147). 
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